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Abstract 

We prove that a group has word problem that is a growing context- 
sensitive language precisely if its word problem can be solved using 
a non-deterministic Cannon's algorithm (the deterministic algorithms 
being defined by Goodman and Shapiro in [B]). We generalise results of 
[B] to find many examples of groups not admitting non-deterministic 
Cannon's algorithms. This adds to the examples of Kambites and 
Otto in J ^ of groups separating context-sensitive and growing context- 
sensitive word problems, and provides a new language-theoretic sepa- 
ration result. 



1 Introduction 

The purpose of this note is to extend the results in Sections 6 and 7 of (H]. 
That article described a hnear time algorithm, which we call Cannon's al- 
gorithm, which generalised Dehn's algorithm for solving the word problem 
of a word-hyperbolic group. Many examples of groups that possess such an 
algorithm were provided, alongside proofs that various other groups do not. 

The Cannon's algorithms described in [6] are deterministic, but there was 
some brief discussion at the end of Section 1.3 of [6] of non-deterministic gen- 
eralisations, and the close connections between groups with non-deterministic 
Cannon's algorithms and those with growing context-sensitive word problem. 
(Non-deterministic Cannon's algorithms and growing context-sensitive lan- 
guages are defined in Section [2] below.) It was clear that an appropriate 
modification of the theorems in Section 7 of [6J should imply that various 
groups, including direct products x Fn of free groups with m > 1 and 
n > 1 and other examples mentioned below, had word problems that were 
context-sensitive but not growing context-sensitive. We provide that modi- 
fication in this paper. 



Our examples are not the first to separate context-sensitive and growing 
context-sensitive word problem. For Fm x Fn with m,n > 1 has recently 
been proved context-sensitive but not growing context-sensitive by Kambites 
and Otto in [7], using somewhat different methods; they failed to resolve 
this question for F^ x Fi = Fm x ^ with m > 1, which is now covered by 
our result. And our result has further application, since, according to [3 
Section 7], the fact that F2 x Fi does not have growing context sensitive 
word problem implies that the class of growing context-sensitive languages 
is properly contained in the language class L(OW— auxPDA(poZy, Zog)). We 
refer the reader to [7] for a definition of this class, and for citations. 

In Section [2] of this article, we provide definitions of the classes of context- 
sensitive and growing context-sensitive languages, and of non-deterministic 
Cannon's algorithms. We then show in Theorem [3] that the set of formal 
languages defined by non-deterministic Cannon's algorithms is the same as 
the set of growing context-sensitive languages that contain the empty word. 
Hence the word problem for a group can be solved using a non-deterministic 
Cannon's algorithm precisely if it is a growing context-sensitive language. 

Two different versions of Cannon's algorithms, known as incremental and 
non-incremental, are defined in [6j, and both of these are deterministic. In 
[6l Proposition 2.3] it is shown that the language of an incremental Can- 
non's algorithm is also the language of a non-incremental Cannon's algo- 
rithm. In [71 Theorem 3.7], it is shown that the class of languages defined by 
non-incremental Cannon's algorithms is exactly the class of Church-Rosser 
languages. It is pointed out in [7J (following Theorems 2.5 and 2.6) that 
the class of Church-Rosser languages is contained in the class of growing 
context-sensitive languages. It follows that every group with an incremental 
or non-incremental Cannon's algorithm also has a non-deterministic Can- 
non's algorithm. (This is not obvious, because replacing a deterministic 
algorithm with a non-deterministic algorithm with the same reduction rules 
could conceivably result in extra words reducing to the empty word.) In 
particular, all of the examples shown in [6] to have (incremental) Cannon's 
algorithms have growing context-sensitive word problem. 

It follows from the (known) fact that the class of growing context-sensitive 
languages is closed under inverse homomorphism that the property of a 
finitely generated group having growing context-sensitive word problem does 
not depend on the choice of finite semigroup generating set of G. Some 
other closure properties of the class of growing context-sensitive groups are 
mentioned at the end of Section [2l 

In Section [31 we generalise Theorems 7.1 and 7.2 of [6] to prove that there 
does not exist a non-deterministic Cannon's algorithm for a group satisfying 
the hypotheses of those theorems; that is, we prove the following. 
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Theorem 1 Let G he a group that is generated as a semigroup by the finite 
set Q, and suppose that, for each n > 0, there are sets Si{n), S2{n) in G 
satisfying 

(1) each element of Si{n) can he represented hy a word over Q of length at 
most n, 

(2) there are constants oq > 0, ai > 1 such that for infinitely many n, 
15*4(7^)1 > a^ai, and 

(3) each element of Si{n) commutes with each element of S2{n). 

Then G cannot have a non- deterministic Cannon's algorithm over any finite 
semigroup generating set. 

Theorem 2 Let G he a group that is generated as a semigroup hy the finite 
set Q, and suppose that, for each n > 0, there are sets Si{n), S2{n) in G 
satisfying 

(1) each element of Si{n) can he represented hy a word over Q of length at 
most n, 

(2) there are constants uq > 0,ai > l,a2 > such that for all sufficiently 
large n, \Si{n)\ > aga" and |S'2(n)| > a2n, and 

(3) each element of Si{n) commutes with each element of S2{n). 

Then G cannot have a non- deterministic Cannon's algorithm over any finite 
semigroup generating set. 

As a corollary we see that a group G satisfying the conditions of Theorem [1] 
or [2] cannot have word problem that is a growing context-sensitive language. 

Theorems 7.1 and 7.2 of |6j are followed by a number of further theorems and 
corollaries (numbered 7.3 - 7.12), which provide a wide variety of examples 
of groups that satisfy the criteria of Theorems 7.1 or 7.2 and hence have 
no deterministic Cannon's algorithm. These examples include F2 x Z, braid 
groups on three or more strands, Thompson's group F, Baumslag-Solitar 
groups (a, t, I ta^t~^ = a^) with p ^ ±q, and the fundamental groups of 
various types of closed 3-manifolds. We can conclude immediately from 
our Theorems [T] or [2] that none of these examples have non-deterministic 
Cannon's algorithms or growing context sensitive word problems. 
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Note that a number of these examples, such as F2 x Z and braid groups [5], 
are known to be automatic and hence to have context-sensitive word problem 

m- 

On the other hand the article [6] gives a wealth of examples of groups that do 
have Cannon's algorithms, and hence have growing context-sensitive word 
problems. These include word-hyperbolic, nilpotent and many relatively 
hyperbolic groups. 

We would like to acknowledge the contribution of Oliver Goodman, many 
of whose ideas are visible in the arguments of this paper. 

2 Growing context-sensitive languages and non- 
deterministic Cannon's algorithms 

We start with the necessary definitions. A phase- structured grammar is a 
quadruple {N,X,(t,V), where N and X are finite sets known respectively 
as the non-terminals and terminals, a £ N is the start symbol, and V is the 
set of productions. The productions have the form u — > w with u € and 
V E A*, where A:= NUX. 

The grammar is context-sensitive if |u| < \v\ for all productions u ^ v. It is 
growing context-sensitive if, in addition, for all productions u ^ v, a does 
not occur in v and either u = a or \u\ < \v\. 

As is customary, to allow for the possibility of having the empty word e in 
a (growing) context-sensitive language (defined below), we also allow there 
to be a production cr — > e, provided that a does not occur in the right hand 
side of any production. 

For u,v € A* , we write u — >* v if we can derive v from u by applying a finite 
sequence of productions to the substrings of u. The language of the grammar 
is the set of words w € X* with a ^* w. A growing context-sensitive lan- 
guage (GCSL) is a language defined by a growing context-sensitive grammar 
(GCSG). 

There is some information on this class of languages in pj. Other possibly 
relevant references are [3] and [3]. It is proved in [2] that the GCSLs form an 
abstract family of languages which implies, in particular, that they are closed 
under inverse homomorphisms and intersection with regular languages. This 
in turn implies that the property of the word problem of a finitely generated 
group G being a GCSL is independent of the choice of finite generating set 
for G, and that this property is inherited by finitely generated subgroups of 
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G. 



A non- deterministic Cannon's Algorithm (NCA) is defined to be a triple 
(X, A, 7^), where X C A are finite alphabets, and 7^ is a set of rules of the 
form f — > ti with u,v ^ A* and \v\ > \u\. For a non-deterministic Cannon's 
Algorithm, we drop the restriction imposed in [6] for the deterministic case 
that no two rules are allowed to have the same left hand sides. We allow 
some of the rules f ^ u to be anchored on the left, on the right, or on both 
sides, which means that they can only be applied to the indicated subword 
V in words of the form vw, wv, and v, respectively, for words w & A*. The 
language of the NCA is defined to be the set of words w G X* with w — e. 

The similarity between GCSGs and NCAs is obvious - replacing the produc- 
tions u ^ V oi the former by rules v ^ u the latter almost provides 
a correspondence between them. Apart from the reversed direction of the 
derivations, there are two principal differences. The first is that in a GCSG 
the derivations start with a, whereas with a NCA the significant chains of 
substitutions end with the empty word e. The second is that NCAs may 
have anchored rules, whereas the definition of a GCSG does not allow for the 
possibility of anchored productions. 

These differences turn out not to be critical, and in this section we shall 
prove the following result. 

Theorem 3 Let L be a language over a finite alphabet X with e € L. Then 
L is growing context-sensitive if and only it is defined by a non- deterministic 
Cannon's Algorithm. 

To handle the anchoring problem, let us define an extended GCSG to be one in 
which some of the productions u ^ v may be left anchored, right anchored, 
or left and right anchored, which means that they can only be applied to 
the indicated subword u in words of the form uw, wu or u, respectively, for 
words w € A*. Note that we do not allow productions with u = a to be 
anchored, and neither is there any need to do so, because they can only be 
used as the initial productions in derivations of words in the language. 

The following proposition tells us that allowing anchored productions does 
not augment the class of GCSLs. 

Proposition 4 If a language L C X* is the language defined by an extended 
GCSG, then L is also defined by a standard GCSG. 

Proof: Suppose L is defined by the extended GCSG {N,X,a,V) and let 
A:=NUX. 
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We first replace the grammar by one in which u G N'^ for all productions 
u ^ V. In other words, no terminal occurs in the left hand side of any 
production. 

To do this, we introduce a new set X of non- terminals in one-one corre- 
spondence with X. For a word w G A*, let tD be the result of replacing 
every terminal x in w; by its corresponding non-terminal x. We replace each 
production -u ^ v by a collection of productions of the form u ^ v', where 
v' ranges over all possible words obtained by replacing some of the terminals 
X that occur in v by their corresponding non-terminals x. This achieves the 
desired effect without altering the language of the grammar. 

After making that change, wc introduce three new sets of non-terminals, 
N~ and "A^", each in one-one correspondence with N. For a word w € A+, 
we define ~w as follows. Let w = xv with x e A, v e A*, li x e N \ {a}, 
then we set ~w = {''x)v, where ~x is the symbol in "TV that corresponds 
to x. Otherwise, if a; € X U {cr}, we set ~w = w. We define w~ and 
similarly. (Note that the new symbols in "N^ are only needed here when 

H = 1.) 

Now each production of the grammar of the form a ^ v is replaced by 
a — > "v". For each non-anchored production u ^ v with u ^ a, we keep 
this production and also introduce new (non-anchored) productions ~u 
"v, u" v~ . and ^u" — > "f". Each left anchored production u v \s 
replaced by the two productions "u — > and ~u~ — > "u and similarly for 
right anchored productions. A left and right anchored production u ^ v \s 
replaced by the single production ~u~ — ^ ~v~ . 

The effect of these changes is that the symbols in ~A^ can only occur as the 
leftmost symbol of a word in a derivation starting from a and, similarly, 
those in A^~ can only occur as the rightmost symbol. Conversely, in any 
word w ^ a that occurs in such a derivation, if the leftmost symbol of ttJ is a 
non-terminal then it lies in "A^ , and similarly for the rightmost symbol. The 
symbols in ^N" can only arise as the result of an initial derivation of the 
form a — ^ "u" with \v\ = 1. So the productions that were initially anchored 
can now only be applied in a production at the left or right hand side of 
the word, and so we have effectively replaced anchored productions by non- 
anchored ones that behave in the same way. Hence the language defined by 
this grammar is the same as that defined by the original grammar. □ 

A natural question that arises at this point is whether anchored rules in 
a NCA can be dispensed with in a similar fashion. The following simple 
example shows that this is not possible. Let X = {a;}. Then L := {a;} is the 
language of the NCA with A = X and the single rule x ^ e that is left and 
right anchored. A NCA without anchored rules recognising L would have 
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to contain the rule x — > e, but then L would also contain for all n > 0. 
However, the proof of Theorem [3] that follows shows that we can make do 
with non-anchored rules together with left and right anchored rules of the 
form 

Proof of Theorem [3} Suppose that L C X* is the language defined by 
the GCSG {N,X,a,V) and that e G L. We define a NCA (X,A,7^) with 
K = X \J N \ {cr}, where the rules TZ are derived from the productions V as 
follows. Productions of the form cr — > u with v ^ e are replaced by left and 
right anchored rules v — s- e. All productions u ^ v with u ^ a are replaced 
by the rule v ^ u. It is easily seen that derivations of words in L using V 
correspond exactly, but in reverse order, to reductions of words to e using 
TZ, so the language of {X,A,TZ) is equal to L. 

Conversely, suppose that L is the language of the NCA {X,A,TZ). Then we 
define an extended GCSG {N, X, a, V) as follows. We introduce cr as a new 
symbol and put N := [K\X) U {a}. We make a — > e a production of V, and 
the remaining productions are derived from the rules IZ as follows. Rules 
of the form t> — > n with u ^ e are replaced by productions u ^ v, where 
anchored rules are replaced by correspondingly anchored productions. 

For a non-anchored rule u — > e, we introduce non-anchored productions 
X ^ XV and x — > for all x G A, together with a production a ^ v. For 
a left-anchored rule v — > e, we introduce left- anchored productions x ^ vx 
for all X G A together with a production o" — > v. Right-anchored rules of 
this form are handled similarly. Finally, for a left and right anchored rule 
— > e, we introduce only the production a ^ v. 

Again there is an order-reversing correspondence between reductions to e 
using TZ and derivations using V, so the language of this grammar is equal 
to L and, by Proposition U we may replace it by a standard GCSG with 
language L. □ 

We saw earlier that the property of a group G having growing context word 
problem is independent of the chosen semigroup generating set of G, and 
is closed under passing to finitely generated subgroups. It is proved in [6l 
Theorems 2.11,2.13] that the property of G having a deterministic Cannon's 
Algorithm is preserved under taking free products and under passing to over- 
groups of finite index. These proofs work equally well for non-deterministic 
Cannon's algorithms, and so having growing context sensitive word problem 
is also closed under these operations. But we shall see in the next section 
that F2 X F2 does not have growing context sensitive word problem. Hence 
we see that the class of groups with growing context sensitive word problem 
is not closed under direct products. 
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3 Groups without non-deterministic Cannon's al- 
gorithm 



This section is devoted to the proofs of Theorems [T] and [21 which extend 
Theorems 7.1 and 7.2 of [6j. 

Our proofs of Theorems [1] and [2] are modifications of the original proofs 
in [6]. We assume the reader is famihar with that work and has it available 
for reference. We will show how to modify those arguments so that they can 
be extended to the non-deterministic case. 

The argument of [6j starts by examining the reduction of a word wq to Wn 
by repeated applications of the length-reducing rules, which are referred to 
as the history to time n of wq- These words are then displayed laid out in 
successive rows in a rectangle. The place in each row where a rule is to be 
applied is marked with a substitution line. The letters resulting from the 
substitution occupy the space on the next row below this line and are each 
given equal width. The authors introduce the notion of a splitting path which 
is a decomposition of such a rectangle into a right and left piece, together 
with combinatorial information on that decomposition. Given two histories, 
vq, . . . ,Vr and wq, . . . ,Ws, if these have equivalent splitting paths, then the 
left half of the first rectangle can be spliced together with the right half of 
the second rectangle in a way which produces the history of the reduction 
starting with v^Wq and ending with v~wf . (The super-scripts denote the 
left and right halves of these words.) The combinatorics of splitting paths 
are such that in certain key situations, exponentially many cases are forced 
to share only polynomially many equivalence classes of splitting paths. The 
hypotheses of Theorems 7.1 and 7.2 of [5] assure a supply of exponentially 
many commutators, each of which must reduce to the empty word. One 
shows that two of these can be spliced together to produce a word which 
does not represent the identity, but which also reduces to the empty word. 
This is a contradiction. 

Essentially we are able to work with the same definitions of histories, split- 
ting paths and their details, and equivalence of splitting paths as |6], but 
need to introduce a definition of equivalence of histories, and re- word and 
re-prove some of the technical results involving these concepts. With those 
revisions, we shall see that minor variations of the original proofs verify the 
non-deterministic versions of the theorem. 



We now describe how to modify the proofs of Theorems 7.1 and 7.2 of [6] to 
prove Theorems [1] and [2j 

Given a non-deterministic Cannon's algorithm and a word wq, there is no 
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longer a unique history to time n of wq. We can call any sequence of words 
wo,wi,...Wn produced as the algorithm makes n substitutions on wq a 
history, although it is no longer valid to call it the history. 

The definitions of a "diagram" , a "substitution line" , and the "width" of a 
letter need no modification, nor do Lemmas 6.1 and 6.2 of [6] which relate 
the width of a letter to its generation. 

The definition of a "splitting path" needs no modification. Lemma 6.4 of [6] 
states that a letter of generation g has a splitting path ending next to it of 
length at most 2(7 + 2. This remains true if we choose the history appropri- 
ately. We now show that we can do this. 

Observe that in the non-deterministic case if a word w contains as disjoint 
substrings two left-hand sides, say u and u' of the rules u — > f and u' ^ v', 
then these two substitutions can be carried out in either order, i.e., either 
as 

xuyu'z — > xvyu'z — > xvyv'z 

or as 

xuyu'z xuyv'z xvyv'z. 
Now consider two histories, 

hi = Wo, . . . , xuyu'z, xvyu'z, xvyv'z, . . . ,Wn 

and 

^2 = Wq, . . . , xuyu'z, xuyv'z, xvyv'z, . . . , Wn- 

(Corresponding ellipses stand for identical sequences.) We will say that 
these are equivalent reduction^ and this generates an equivalence relation 
on reductions starting with wq and ending with Wn. We can then speak 
of corresponding substitutions in equivalent reductions. Notice that corre- 
sponding substitution lines in equivalent reductions have the same width, 
occur at the same position horizontally, consume the same letters with the 
same widths and generations and produce the same letters with the same 
width and generation. 

Given a history wq, . . . ,Wn, there is a partial ordering of its substitutions 
which is generated by the relation si -< S2 if S2 consumes a letter produced 
by si. The relationship ~< is visible in the diagram of the history in that 
si -< S2 if and only if there is a sequence of substitution lines with horizon- 
tally overlapping segents starting at si and ending at S2- In particular, ^- 
incomparable substitution lines do not overlap in horizontal position, except 

^The notion of equivalent reductions is not to be confused with the notion of equivalent 
histories defined below. Accordingly, we will briefly refer to histories as reductions to 
distinguish these equivalence relations. 
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possibly at their endpoints. (We will omit further mention of this possible 
exception.) Because of this, given two ^-incomparable substitutions si and 
S2 we either have si lying to the left of S2 or vice versa. Notice that if si lies 
to the left of S2, then this is so for the corresponding substitutions in any 
equivalent reduction. 

Lemma 5 Suppose that hi = wq, . . . ,Wn is a reduction containing the sub- 
stitutions si and S2 in which si takes place before S2 and si and S2 are 
-{-incomparable. Then there is an equivalent reduction /12 in which the sub- 
stitution corresponding to si takes place after that corresponding to 82- 

Proof: Note that si and S2 do not overlap horizontally. Thus, if these two 
substitutions take place at successive words of /ii, we are done. 

Suppose now that no ^-ancestor of S2 takes place later than si. In that 
case, si can be interchanged with the immediately preceding substitution, 
thus reducing by 1 the number of substitutions occurring between si and 
S2. Continuing this in this way produces the previous case. 

Finally, suppose there is S3 ~< S2 with S3 occurring later than si. Let us 
suppose that S3 is the earliest such. Then si and S3 are ^-incomparable, 
for otherwise we would have si ^ S2. Applying the previous case allows us 
to move S3 prior to si, thus reducing by 1 the number of ^-ancestors of S2 
lying between si and S2. Continuing in this way produces the previous case. 
□ 

Corollary 6 Suppose that hi = wq, . . . ,Wn is a reduction and that Si and 
T,2 are disjoint sets of substitutions in hi with the property that no substi- 
tution 0/ Si is -(-comparable with any substitution 0/S2. Then there is an 
equivalent reduction /12 in which every substitution 0/ Si takes place before 
every substitution 0/ S2 . □ 

In view of this, by passing to an equivalent reduction, we may assume that 
if si and S2 are ^-incomparable and si takes place to the left of S2, then si 
takes place prior to S2. 

Using this assumption on our choice of history justifies the statement in 
the proof of Lemma 6.4 that, "When this happens it can only be with 
substitutions to the left in the upper half and to the right in the lower." 
The proof now goes through as before. 

The definitions of "details" and equivalence of splitting paths need no mod- 
ification. Remark 6.5 of [6] gives a bound on the number of equivalence 
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classes of length n. This remains valid. (However, we could simplify the 
details by ceasing to record the W — 1 letters to the left/right in a right/left 
segment as part of the details. These are only needed to ensure that substi- 
tutions take place in the intended order in the deterministic case in Lemma 



Lemma 6.6 of [6j states that if two histories, vq, . . . ,Vr and wq, . . . ,Ws have 
equivalent splitting paths then these can be spliced to form the history 
starting with v^Wq and ending with v^wf. In our case, we need to modify 
the statement of Lemma 6.6 of [6] to say "Then a history of VqWq up to a 
suitable time ..." rather then "'the history", because this history may not be 
unique. With that change. Lemma 6.6 remains true. 

The paragraph after the proof of Lemma 6.6 no longer applies at all; that 
is, Vr is not necessarily determined even in a weak sense by vq. 

Section 6.1 adapts these methods to respect subword boundaries. Lemma 
6.7 of ^ says that if wq is a subword of vq of length and wt has length 
at least 2W — 1, then the reduction of wq to wt (and hence the reduction of 
Vq to vt) has a splitting path in one of at most CiN^"^ classes. The results 
of this section up to and including this Lemma remain valid. 

Lemma 6.8 of [B] discusses the way that the choice of uq affects the subword 
reduction of vq in the word uqVq. This does not make sense as stated for 
a non-deterministic algorithm, because reduction of uqVq to utVt no longer 
gives the word vt as a function of uq. 

In order to state an analogous result, we need an appropriate notion of 
equivalence of histories. For fixed vq and variable uq, we define two histories 
■^o^^o ~^ UtVt and u'qVq u[v[ to be equivalent if either: 

(i) l{vt) < W and vt = v[; or 

(ii) l{vt), liv'-f.) > W, the first W — 1 letters of vt and v'^ are the same, and the 
two histories have equivalent splitting paths that begin at the same place in 
Vq and end immediately after the first W — 1 letters of vt and v'^. 

Then it follows from Lemma 6.6 of [6] that, if uqVq — > utVt and u'qVq — > u[v[ 
are equivalent histories, then there is also a history u'qVq — > u[vt, to which 
both are equivalent. (Note: this notation may seem to imply that all of 
these histories have the same length t, but of course they need not. All 
three lengths might be different!) 

Then the proof of Lemma 6.8 of [B] shows that, for fixed vq of length N, 
there are at most CqN^ equivalence classes of histories uqVq — > utVt for 
variable uq. 



6.6.) 
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We turn now to the proof of our Theorem [H As we have already seen, 
the equivalent properties of having growing context sensitive word problem 
and having non-deterministic Cannon's algorithms hold independently of 
the finite semigroup generating set Q, so we only need to prove the non- 
existence of the Cannon's algorithm over Q. We can further assume that 
1 € ^ so that any element which is represented by a word of length less than 
or equal to n is also represented by a word of length n. (It is also easily 
seen directly that the hypotheses of Theorems [U and [2] do not depend on 
the choice of generators.) Assume for a contradiction that the hypotheses 
of that theorem hold and that there exists a non-deterministic Cannon's 
algorithm over a working alphabet A that contains Q. 

First we choose a specific value of n > 3W that is large enough for this 
stronger version of (1) to apply, 15*4(7^)1 > ao"i! and also such that n is 
big enough so that that this exponential function is larger than a particular 
polynomial function that comes out of some of our technical lemmas. More 
precisely, we require 

where C, Cq, Ci and C2 are the constants defined in Lemmas 6.7 and 6.8. 

For i = 1, 2, let Tj be a set of words of length at least 'iW and at most n 
representing the elements of Si{n). Since each element of Si{n) commutes 
with each element of 52(n), we have uqVqUq^Vq^ =g 1 for all uq E Ti and 
^0^22, and hence this word can be reduced, not necessarily uniquely, to 
the empty word by means of the Cannon's algorithm. For ease of notation 
we write xq for Uq"^ and yo for v^"^ . For each such uq and uq, we choose 
some sequence of substitutions that reduces uo^^o^^oyo to the empty word 
and let utVtXtVt be the word that we get after applying t substitutions in 
this sequence. 

For such a commutator, we run the algorithm to the point where for the 
first time vt and xt both have length less than 2>W . At that time the longer 
one has length in the range [2W, 'iW — 1]. Note that t depends on uq and uq, 
and where it is used below it should seen that way, and not as a constant. 

First we assume that for at least half of the pairs {uq^vq) G Ti x T2, we have 
K^t) ^ K^t)- We shall deal with the opposite case later. 

Step 1. Now we fix a f £ ^2, chosen such that l{vt) > l{xt) for at least 
half of the words uq £ Ti, and we let U be the set of all words uq with this 
property. Then \U\ > ^aoa". 

Step 2. Since we have l{vt) > 1W , we can apply Lemma 6.7, which says 
that we can choose a splitting path for the subword history , , • • • ft in 
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one of polynomially many equivalence classes (Cin^, to be precise). That 
is to say there are polynomially many sets of details that can describe such 
a splitting path. This remains true (with the number increased to CinP'^'^'^) 
if we add to the detail the information that tells us where within vq the 
splitting path begins and where within vt the splitting path ends. We call 
that the extended detail. 

Step 3. Now since vq is fixed we have a well defined map uq i— > vtXtUt- 
(Recall that, although there may be many possible reduction sequences for 
UQVoXoyoi we arbitrarily chose some fixed sequence for each uq-) Now we 
apply our amended version of Lemma 6.8 with uqVqXq in place of mq and yo in 
place of Vq. This tells us there are at most polynomially (Con*^ in fact) many 
equivalence classes of histories uqVqXqUq utVtXtyt- But uq comes from a 
set of exponential size (at least ^a^a^), which we have chosen to be bigger 
than the appropriate polynomial, through our choice of n. So we have a large 
set (of size more than Cin'^^+s^ of € C/ that give rise to the same words vt 
and xt, and with the property that the histories uo^^o^^oyo ut{uo)vtXtyt{uo) 
(where vt and xt are fixed, but ut and yt depend on uq) are all equivalent, 
in the sense defined above in our comments about the amended Lemma 6.8. 

But, as we also noted above. Lemma 6.6 implies that these histories are 
also all equivalent to histories uo^^o^^oyo — > ut{uo)vtXtyt for the same fixed 
yt- Note that in these two equivalent histories uoVQXQyo —i- ut{uo)vtXtyt{uo) 
and UQVoXoyo — > ut{uo)vtXtyt, the parts of the two histories to the left of 
the splitting line are the same except for the number of steps in which the 
words remain constant. So we can use essentially the same splitting paths 
as we chose in Step 2 for the second history. 

Step 4- Since the number of uq giving rise to equivalent histories in Step 3 is 
greater than the number of equivalence classes of positioned splitting paths 
for Vt in Step 2, we can choose uq, Uq & U such that uo^^o^^oyo utVtXtyt and 
UoVQXQyo — > u[v[x'ty[ are equivalent histories with v[ = vt and = x^, and 
such that the subhistories vq vt and vq — > v'^ in the two histories contain 
equivalent splitting paths, which start at the same position in vq and end in 
the same position in vt = v[. 

By the remark above, the second of these histories (and hence also the 
first!) is equivalent to a history UQVQXQyQ u'^v'-^x'^yt, which still contains an 
equivalent splitting path through vq v[. Now we can do our splicing, and 
apply Lemma 6.6 of p] to produce a history uqVq v'^ x'^y'^ —>■ utv^ v'^ x[yt = 
UtVtXtyt- 

But UtVtXtyt is part of the originally chosen history that reduces the com- 
mutator UQVQXQyQ to the empty word, so there exists a history that reduces 
UQU^v'^x^yQ to the empty word, a contradiction, because this is not the 
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identity element of the group. 

In the second case (not considered in detail in Section 7 of [6J) where l{xt) > 
l{vt) for at least half the pairs {uo,vo) £ Ti x T2, rather than fix vq G 
T2 we fix uq G Ti in a similar way, and let V be the possible vq from 
which, together with the chosen uq, we get l{xt) > l{vt)- Then we apply 
Lemma 6.7 to the subword histories xi,...xt. Now we look at the map 
■^0 ^ utVtXt- The analogue of Lemma 6.8 applied to utVt tells us that ut 
can take polynomially many values, and we see that we get a large set of 
possible Vq £ V corresponding to a single utVtXt- Hence we can choose 
Vq mapping such that UfVtXt = u[v[x[ and such that the subword histories 
xi, . . . xt and x[, . . . x[ have the same extended details, and we can splice. 
Hence we see that the algorithm should rewrite uoVox^x'^Uq to utVix^ x'^ y[. 
Since = Xq, the first of these two words is equal to uqVqXqiIq^ equal in the 
group to uo^^cT^' non-trivial. But the second word is equal to u[v[x'iy[, 
which rewrites to the trivial word. Hence again we get our contradiction. 

Modifying the proof of Theorem 7.2 from [6j in the same way, we arrive at 
a proof of Theorem [2l 

Basically we choose ni,n2 with 

which we can do, for example, by first setting n2 equal to some polynomial 
function in ni so that (2) is satisfied for all ni, and then choosing rii big 
enough so that (1) holds. (But notice that we need Hypothesis (2) in the 
statement of the theorem to be satisfied for these particular values of ni 
and 77.2, which is why we have assumed this hypothesis for all integers n > 
rather than for infinitely many such n, which was sufficient for Theorem [TJ) 

As in the proof of Theorem[Tl for i = 1, 2, we choose Tj to be a set of words of 
length at least 3W and at most n that represent the elements of Si. Those 
conditions will now ensure each ^|Tj| {i = 1,2) is bounded below by the 
appropriate polynomial function of n2, ni which allows us to find uq,Uq. So 
in the case where vt is longer than ut we can do just what we did in the first 
case of Theorem [H since Ti is big enough we can find a big enough set of 
elements of U mapping to the same vtXtVt- 

And in the second case we have T2 big enough, and so we can follow the 
argument used in the second case of the proof of Theorem [T] 
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